A Sentence-pitch-contour Generation Method Using Vq/hmm for Mandarin Text-to-speech

نویسندگان

  • Hung-Yan GU
  • Chung-Chieh YANG
چکیده

In this paper, a method with sentence-wide optimization consideration is proposed to generate a Mandarin sentence's pitch-contour. The developed model is called the sentence pitch-contour HMM (SPC-HMM) due to its use of VQ (vector quantization) and HMM (hidden Markov model). To construct an SPC-HMM, the pitch-contours of the syllables from each training sentence are normalized on both time and pitch-height first. The method for pitch-height normalization is effective and newly developed here. After normalization, the pitch-contour of each training syllable is vector quantized. Then, the quantization code and lexical tones of adjacent syllables are combined to define the observation symbol sequences for HMM training. In the synthesis phase, when given a sentence and its relevant text-analysis information, the most probable observation sequence is generated by finding the sentence-wide largest probability path with a dynamic-programming based algorithm. We had conducted practical perception tests. It is found that the speech synthesized by using the sentence pitch-contour generated from out method is slightly better than uttered by an ordinary speaker. Besides, the comprehensibility of the synthesized speech is also promoted.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An HMM Based Pitch-Contour Generation Method for Mandarin Speech Synthesis

In this paper, a method is proposed to generate pitch-contours for Mandarin speech synthesis. In this method, an HMM (hidden Markov model) is used to model the prosodic states implicitly stayed and a syllable’s pitch-contour is treated as an observation generated from a prosodic state. Such an HMM is called a syllable pitch-contour HMM (SPC-HMM). For training the SPC-HMM, we developed a feasibl...

متن کامل

Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech

Expressive speech synthesis has received increased attention in recent times. Stress (or pitch accent) is the perceptual prominence within words or utterances, which contributes to the expressivity of speech. This paper summarizes our contribution to Mandarin expressive speech synthesis. A novel hierarchical stress modeling and generation method for Mandarin is proposed and further integrated i...

متن کامل

Modeling Pitch Contour of Chinese Mandarin Sentence with PENTA Model

In continuous speech, it is believed that the pitch contour of the same syllable may vary a lot due to its different context information. To apply the Parallel Encoding and Target Approximation (PENTA) model to Mandarin speech synthesis and improve its prediction accuracy, this paper proposed a method to predict pitch contours for Chinese syllables with different contexts by combining the Class...

متن کامل

Modeling Pitch Contour of Chinese Mandarin Sentences with the PENTA Model

In continuous speech, the pitch contour of the same syllable may vary much due to its contextual information. The Parallel Encoding and Target Approximation (PENTA) model is applied here to Mandarin speech synthesis with a method to predict pitch contours for Chinese syllables with different contexts by combining the Classification And Regression Tree (CART) with the PENTA model to improve its ...

متن کامل

Modelling and Decision Tree Based Prediction of Pitch Contour in Ibm Mandarin Speech Synthesis System

In this paper, a method of pitch contour modelling based on the hidden Markov model (HMM) states of an acoustic unit is presented. A pair of vectors is computed from the alignment of the speech data with the acoustic unit’s HMM states. The pitch contour feature of the acoustic unit is represented by the vector pair so that the variants of the acoustic unit’s pitch contour can be measured and co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000